Can Confidence Scores Post-editing Speech Recog
نویسندگان
چکیده
When dictating with speech recognition, most of the user’s time is spent correcting errors. To decrease the burden we propose new editor functions specifically to speed up the correction process. The idea is to use a recognition confidence measure to predict which words are likely to be in error, to display that information to the user by highlighting suspect words, and to provide a command to let the user jump the cursor to the next suspect word. Simple experiments suggest that these functions can be valuable, even with today’s speech recognizers and confidence measures. 1. MOTIVATION AND PROPOSAL Speech dictation software, although increasing popular, is still not in wide use. One reason in the need to correct errors. To estimate the magnitude of this problem, we had a few subjects enter the same short passage in Japanese using one of the best commercial dictation systems. Although speaking the text was 3 or 4 times faster than keying it, when the time spent correcting errors was considered, there was only a small speed advantage. About 60% of the total dictation time was spent correcting errors. Correcting an error takes three steps:
منابع مشابه
Automatic Evaluation of Dutch Pronunciation by Using Speech Recognition Technology
The ultimate aim of the research reported on in this paper is to develop a system for automatic assessment of foreign speakers' pronunciation of Dutch. The aim of the experiment described here was to determine whether pronunciation ratings assigned by human experts could be predicted on the basis of scores calculated by an automatic speech recog-nizer. To this end 20 native and 60 non-native sp...
متن کاملTowards the Question: Why H Such an Impact on Speech Recog
It has repeatedly been shown, mostly in terms of WER, that the rate of speech significantly affects speech recognition accuracy. However, the question how is not yet satisfactorily answered. In this paper we scrutinized in which way already modeling accuracy is influenced by the rate of speech. We observed the existence of a rather direct (negative) correlation between the local speech rate (LS...
متن کاملUsing the TED Talks to Evaluate Spoken Post-editing of Machine Translation
This paper presents a solution to evaluate spoken post-editing of imperfect machine translation output by a human translator. We compare two approaches to the combination of machine translation (MT) and automatic speech recognition (ASR): a heuristic algorithm and a machine learning method. To obtain a data set with spoken post-editing information, we use the French version of TED talks as the ...
متن کاملSystem for Speech Transcription and Post-Editing in Microsoft Word
In this demonstration paper, we introduce a transcription service that can be used for transcription of different meetings, sessions etc. The service performs speaker diarization, automatic speech recognition, punctuation restoration and produces human-readable transcripts as special Microsoft Word documents that have audio and word alignments embedded. Thereby, a widely-used word processor is ...
متن کاملCorrelation between Automatic Evaluation Metric Scores, Post-Editing Speed, and Some Other Factors
This paper summarises the results of a pilot project conducted to investigate the correlation between automatic evaluation metric scores and post-editing speed on a segment by segment basis. Firstly, the results from the comparison of various automatic metrics and post-editing speed will be reported. Secondly, further analysis is carried out by taking into consideration other relevant variables...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002